An application of the temporal difference algorithm to the truck backer-upper problem
نویسندگان
چکیده
We use a reinforcement learning approach to learn a real world control problem, the truck backer-upper problem. In this problem, a tractor trailer truck must be backed into a loading dock from an arbitrary location and orientation. Our approach uses the temporal difference algorithm using a neural network as the value function approximator. The novelty of this work is the simplicity of our implementation, yet it is able to successfully back the truck into the loading dock from random initial locations and orientations.
منابع مشابه
A Mathematical Model and Grouping Imperialist Competitive Algorithm for Integrated Quay Crane and Yard Truck Scheduling Problem with Non-crossing Constraint
In this research, an integrated approach is presented to simultaneously solve quay crane scheduling and yard truck scheduling problems. A mathematical model was proposed considering the main real-world assumptions such as quay crane non-crossing, precedence constraints and variable berthing times for vessels with the aim of minimizing vessels completion time. Based on the numerical results, thi...
متن کاملFuzzy Logic in Control: Truck Backer-Upper Problem Revisited
-Truck backer-upper problem, considered an acknowledged benchmark in nonlinear system identification, is an excellent test-bed for fuzzy control systems. Fuzzy controller, formulated on the basis of human understanding of the process or identified from measured control actions, can be regarded as an emulator of human operator. Controller design, however, may become difficult, especially if the ...
متن کاملA Genetic Approach to the Truck Backer Upper Problem and the Inter-twined Spiral Problem
Neural networks are a biologically motivated problem-solving paradigm that has proven successful in robustly solving a variety of problems. This paper describes another biologically motivated paradigm, namely genetic programming, which can also solve a variety of problems. This paper explains genetic programming and applies it to two well-known benchmark problems from the field of neural networ...
متن کاملGenerate Fuzzy Membership Function using Particle Swarm Optimization
In this paper, we will proposed a hybrid method to generate fuzzy membership function automatically. Particle Swarm Optimization (PSO) is used as optimized algorithm, supplement the performance of fuzzy system. The PSO is able to generate an optimal set of parameter for the membership functions automatic adjustment. Fuzzy control system that automatically backs up a truck to a specified point o...
متن کاملQuay Cranes and Yard Trucks Scheduling Problem at Container Terminals
A bi-objective mathematical model is developed to simultaneously consider the quay crane and yard truck scheduling problems at container terminals. Main real-world assumptions, such as quay cranes with non-crossing constraints, quay cranes’ safety margins and precedence constraints are considered in this model. This integrated approach leads to better efficiency and productivity at container te...
متن کامل